Prokaryotic immune defence with an icrRNA and without Cas6 1 An Active Immune Defence with a Minimal CRISPR (clustered regularly interspaced short palindromic repeats) RNA and Without the Cas6 Protein
نویسندگان
چکیده
The prokaryotic immune system CRISPRCas is a defence system that protects prokaryotes against foreign DNA. The short CRISPR RNAs (crRNAs) are central components of this immune system. In CRISPR-Cas systems type I and III crRNAs are generated by the endonuclease Cas6. We developed a Cas6b-independent crRNA maturation pathway for the Haloferax type I-B system in vivo, that expresses a functional crRNA that we termed independently generated crRNA (icrRNA). The icrRNA is effective in triggering degradation of an invader plasmid carrying the matching protospacer sequence. The Cas6b-independent maturation of the icrRNA allowed mutation of the repeat sequence without interfering with signals important for Cas6b processing. We generated 23 variants of the icrRNA and analysed them for activity in the interference reaction. icrRNAs with deletions or mutations of the 3 ́ handle are still active in triggering a interference reaction. The complete 3 ́ handle could be removed without loss of activity. However manipulations of the 5 ́ handle mostly led to loss of interference activity. Furthermore we could show that in the presence of an icrRNA a strain without Cas6b (∆cas6b) is still active in interference. . Prokaryotes defend themselves against invaders using several different mechanisms to degrade foreign DNA or RNA, one of which is the CRISPR-Cas system (1-6). This defence mechanism progresses in three steps: (1) adaptation, (2) CRISPR RNA expression and processing and (3) invader degradation. During the first step, the cell identifies a new invader and integrates a piece of the invader DNA (termed protospacer) into the CRISPR locus of the host (as soon as the protospacer has been integrated into the CRISPR locus, it is termed http://www.jbc.org/cgi/doi/10.1074/jbc.M114.617506 The latest version is at JBC Papers in Press. Published on December 15, 2014 as Manuscript M114.617506 Copyright 2014 by The American Society for Biochemistry and Molecular Biology, Inc. Prokaryotic immune defence with an icrRNA and without Cas6 2 spacer). An important distinguishing characteristic for the selection of a protospacer in the type I and type II CRISPR-Cas systems, is the protospacer adjacent motif (PAM) (7,8). This motif is located in the invader DNA, directly adjacent to the protospacer. The PAM sequence is important, not only for its selection as a spacer but also for the third step of the process, the interference reaction. In the second step of the defence the CRISPR RNA is synthesized yielding a long pre-crRNA that is processed into the mature functional crRNAs. These short RNAs are essential for the last step, the interference, where they detect the invader sequence and trigger degradation of the invader by Cas proteins (2,9). The prokaryotic immune system comes in a variety of different types that all carry out the same reaction namely the defence against foreign DNA or RNA. Data about the different systems reported show, that although they carry out the same reaction they clearly differ in various aspects of the pathway. The different types of CRISPR-Cas have been grouped on the basis of their various Cas proteins into three major classes: CRISPR-Cas type I, II and III (7), that have been further divided into initially ten subclasses (IA-F; IIA-B; IIIA-B) (7), with the number of newly defined subclasses constantly rising as more data about the systems are reported (10,11). The key element in all CRISPR-Cas defence systems is the crRNA. The biogenesis of the crRNA involves either two or three steps, depending on the system: in all cases the first step is the transcription of the CRISPR RNA locus into a long precursor, the pre-crRNA. The following maturation of the crRNA is catalysed by the Cas6 protein in CRISPR-Cas type I and type III systems. In some type I systems Cas6 is part of the CRISPR associated complex of antiviral defence (Cascade) (12), that consists of different Cas proteins depending on the subtype (2). In contrast in type III system Cas6 is a standalone endonuclease (13,14). Processing by Cas6 within the repeat sequence directly yields the mature functional crRNA in types I-A, I-E and I-F (9). The resulting crRNA consists of an 8 nucleotide repeat derived 5 ́ handle, the invader-targeting spacer sequence and the 3 ́ handle, that contains the remainder of the repeat sequence (Figure 1A) (2). In some type I systems (I-E and I-F) the Cas6 proteins stays bound to the crRNA after processing. In type III systems a second maturation step is observed after Cas6 processing, that shortens the crRNA 3 ́ end and sometimes removes the complete repeat sequence downstream of the spacer (14,15). The initial invader DNA recognition is governed by Watson–Crick base pairing with a 7–10 nt segment of the crRNA referred to as the ‘seed’ sequence (16-19). The seed sequence is involved in initial pairing between crRNA and invader, and allows rapid probing of different regions of cellular nucleic acids. If a perfect match between seed sequence and target DNA is found the remainder of the spacer sequence of the crRNA base pairs with the invader DNA. In the type I-E system the seed sequence is a seven nucleotide long non contiguous sequence between the 5 ́ end of the crRNA-spacer sequence and the invader (17). In the type I-B system this seed sequence is with 10 nucleotides slightly longer (20). An additional prerequisite for the interference is the presence of the PAM sequence in the invader DNA (2). Here we investigate the function of Cas6 in the interference reaction and the essential requirements for the crRNA in the type I-B system of the archaeon Haloferax volcanii. H. volcanii contains only one CRISPR-Cas system (I-B), that consists of eight Cas proteins (Cas1Cas5, Cas6b, Cas7 and Cas8b) and three CRISPR RNA arrays (20). We could previously identify the PAM sequences for this system showing that six different PAMs are active in triggering degradation (21). The Haloferax I-B system has a Cascade like complex, with Cas6b copurifying with the Cas5 and Cas7 proteins and the crRNA (22). It has been shown that the Cas6b protein is involved in crRNA maturation and that the crRNA 5 ́ handles are eight nucleotides long, however different 3 ́ length have been reported (22). We developed here a Cas6b-independent crRNA maturation pathway for the Haloferax type I-B system in vivo, that expresses a functional crRNA that we termed independently generated crRNA (icrRNA). The icrRNA is transcribed with flanking tRNA-like structures (so-called t-elements) that are processed by the tRNA processing enzymes RNase P and tRNase Z (23). The icrRNA is effective in triggering degradation of an invader plasmid carrying the matching protospacer sequence. We show here that a minimal crRNA in the I-B system needs a seven nucleotide 5 ́ handle and does not require a 3 ́ handle at all. In addition, we show that the Cas6b protein is not required for the interference reaction when an icrRNA is present. Prokaryotic immune defence with an icrRNA and without Cas6 3 With the Cas6b-independent maturation pathway developed here, the first in vivo analysis of crRNA characteristics essential for the interference reaction was possible. EXPERIMENTAL PROCEDURES Strains H. volcanii strains H119 (strains used are listed in Table 1), ∆cas6 (ΔpyrE2, ΔleuB, ΔtrpA, Δcas6) (22) and ΔC (ΔpyrE2, ΔleuB, ΔtrpA, ∆C::trpA) (this study) were grown aerobically at 45°C in Hv-YPC medium (21). H. volcanii strains ∆cas6 and ΔC containing plasmids were grown in Hv-Ca or Hv-min medium with the appropriate supplements. E. coli strains DH5α (Invitrogen) and GM121 (24) were grown aerobically at 37°C in 2YT medium (25). Construction of plasmids and transformation of H. volcaniiThe plasmids for expressing icrRNA (pTA409-telecrRNA, pTA232telecrRNA and telecrRNA variants in both vectors) were generated as follows (plasmids are listed in Table 1). The DNA fragment containing the crRNA or crRNA mutants flanked by t-elements were ordered from GeneArt® as plasmids pMA-RQ-telecrRNA, pMA-telecrRNA. Plasmids contained a synthetic Haloferax promoter (Anice SabagDaigle and Charles J. Daniels, in preparation), the crRNA, flanked by t-elements and a synthetic Haloferax terminator (Anice SabagDaigle and Charles J. Daniels, in preparation). Plasmids were digested with KpnI and BamHI to isolate the DNA fragment containing the complete insert. The resulting fragment was cloned into pTA409 (26) and pTA232 (27) (both digested with KpnI and BamHI). Four crRNA mutants were generated by inverse PCR on pMA-telecrRNA using primer pairs (primer sequences are listed in Table 1) itele1/del1, itele1/del1, itele1/del1 and itele1/del1 to generate variant 13 (deletion of the last five nucleotides of the 3 ́ handle), 14 (deletion of the last ten nucleotides of the 3 ́ handle), 15 (deletion of the last fifteen nucleotides of the 3 ́ handle) and 16 (deletion of the last twenty nucleotides of the 3 ́ handle), respectively. In preparation for transformation all plasmids were passaged through E.coli GM121 cells to avoid methylation. Haloferax cells were subsequently transformed using the polyethylene glycol method (27,28). Generation of a CRISPR locus C gene deletion strain (∆C)The deletion of the CRISPR locus C was achieved by using the pop-in/pop-out method as described previously (24,25,29). The region upstream of the gene for CRISPR locus C was PCR amplified with flanking regions from the chromosomal DNA of H. volcanii strain H119 using primers Cdelup (containing the restriction site KpnI) and Cdelupi (containing the restriction site EcoRV). The resulting 300 base pair PCR fragment was subsequently cloned into the vector pTA131 (digested with KpnI and EcoRV), yielding pTA131-Cup. Next the region downstream of the locus C gene was amplified using primers Cdeldo (containing the restriction site XbaI) and Cdeldoi (containing the restriction site EcoRV). The resulting 500 bp fragment was cloned into the plasmid pTA131-Cup (digested with EcoRV and XbaI), yielding plasmid pTA131-Cupdo. This plasmid was digested with EcoRV to insert the marker gene trpA (coding for tryptophan synthase A). The tryptophan marker trpA was amplified using plasmid pTA132 (27) as template and oligonucleotides TRP1/TRP2, cloning of the trpA marker gene into the plasmid pTA131Cupdo resulted in pTA131-CupdoTrp. Plasmids were passaged through E. coli GM121 to prevent methylation, and H. volcanii strain H119 was subsequently transformed with this construct to allow integration (pop-in) of the plasmid into the genome. The subsequent selection for loss of the pyrE2 marker by plating on 5-fluoroorotic acid (5-FOA) revealed pop-out mutants. To confirm the removal of the gene for CRISPR locus C we performed a southern blot analysis. Chromosomal DNA was isolated from the wild type and potential locus C gene deletion mutants. Southern blot hybridisation was performed as described (27), with the following modifications: 10 μg of SacII-digested DNA was separated on a 0.8% agarose gel and transferred to a nylon membrane (Hybond-N, GE Healthcare). A 250 bp fragment of the downstream region of locus C was amplified using primers Cdeldoi and DOmitteC, the fragment was radioactively labelled using α-P-dCTP and random prime kit ReadiprimeII (GE Healthcare) and subsequently used as a hybridisation probe (Suppl. Figure 5). Plasmid invader testsThe invader plasmid construct pTA352-PAM3CSp1 (30) and pTA409-PAM3CSp1 (16) were generated based on the Haloferax shuttle vectors pTA352 (pHV1, leuB) (31) and pTA409 (pHV1, pyrE2) (26) including spacer 1 of the CRISPR locus C (C1) and the PAM sequence TTC (PAM3)(16,21). As a control reaction Prokaryotic immune defence with an icrRNA and without Cas6 4 Haloferax cells expressing the icrRNA (wt or mutants) were transformed with the vector without insert (pTA352 or pTA409). Plasmids were passaged through E. coli GM121 cells (to avoid methylation) and were then introduced into Haloferax cells using the PEG method (27,28). To confirm the identification of a functional invader sequence, H. volcanii cells were transformed at least three times with the plasmid invader construct or the control vector. For plasmid invader tests transformations with at least a 100-fold reduction in transformation rates are considered successful interference reactions (21,32). High reductions in transformation rates provide evidence for high targeting efficiency of the crRNA analysed. Northern blot hybridisationTotal RNA was isolated if not stated otherwise from exponentially growing H. volcanii cells as described (16). After separation of 10 μg RNA (total RNA) on 8% denaturing gels, RNA molecules were transferred to nylon membranes (Hybond-N+, GE Healthcare) and incubated with oligonucleotides against the spacer 1 from locus C (primer C1). The primer was radioactively labelled at the 5 ́ end with γ-PATP and subsequently used for hybridisation. Investigation of icrRNAsTo determine the exact length and sequence of the crRNA, RNA was isolated from wild type Haloferax cells (H119) and strain ∆C x pTA232-telecrRNA grown to an OD 0,74. RNA was separated on 8% PAGE and RNA ranging in size from 45-55 nucleotides (fraction 1) and from 60-75 nucleotides (fraction 2) was eluted and sent to vertis Biotechnologie AG for cDNA preparation and RNAseq analysis. The RNA samples were first treated with polynucleotide kinase (PNK) and then poly(A)-tailed using poly(A) polymerase. Afterwards, an RNA adapter was ligated to the 5'-monophosphate of the RNA. First-strand cDNA synthesis was performed using an oligo(dT)-adapter primer and the M-MLV reverse transcriptase. The resulting cDNAs were PCR-amplified to about 10-30 ng/μl using a high fidelity DNA polymerase. The cDNAs were purified using the Agencourt AMPure XP kit (Beckman Coulter Genomics) and were analysed by capillary electrophoresis. For Illumina sequencing, the cDNA samples were mixed in approximately equal amounts. An aliquot of the cDNA pool was analysed by capillary electrophoresis. The primers used for PCR amplification were designed for TruSeq sequencing according to the instructions of Illumina. RNAseq mappingFirst, original reads were trimmed according to their sequencing quality using the fastq_quality_trimmer program from the FASTX-Toolkit version 0.0.13 with the options "-t 13 -Q 33" (http://hannonlab.cshl.edu/fastx_toolkit/). The parameter -Q is required due to the ASCII offset of 33 used for the quality scores in the Sanger format. The estimated probability that a base call is incorrect (p>0.05) correspond to quality values below 13 (33). Second, trimmed reads were mapped with Segemehl (34) version 0.1.3 with the options "--polyA --prime 'AGATCGGAAGAGCGTCGTGTAGGGAAA GAGTGTAGATCTCGGTGGTCGCCGTATC ATT'". This setting removes the poly(A) tail and the 3' Illumina sequencing adapter. The following percentages of the original reads were successfully mapped from each sample: 86 % for S1 (wild type RNA fraction of 60-75 nt length), 74 % for S2 (wild type RNA fraction of 45-55 nt length), 91 % for S3 (icrRNA fraction of 60-75 nt length), and 81 % for S4 (icrRNA fraction of 45-55 nt length). All samples had 20-40 million reads. Subsequent to mapping, alignments were filtered such that they had a maximum edit distance of 2, were located on the reverse strand (because CRISPR locus C is transcribed from the reverse strand), and matched uniquely to the genome. The filtering produced a clearer signal, but did not change original profiles. To explore and display RNAseq results, we used the Integrative Genomics Viewer (IGV) version 2.0.3 (35). RESULTS To determine the essential nucleotides of the crRNA for the interference and to investigate whether the Cas6b protein is required for the interference reaction we established a Cas6bindependent crRNA generation in H. volcanii. Using this set up, we could study the effect of crRNA mutants on the interference reaction independently of the crRNA processing stage; thus we captured crRNA characteristics, which were specific to the interference reaction. Cas protein independent generation of crRNAs We generated a plasmid that encodes the crRNA as well as t-elements directly upand downstream of the crRNA (Figure 1B). The crRNA is derived from the Haloferax CRISPR locus C and contains spacer 1 of this locus. The Prokaryotic immune defence with an icrRNA and without Cas6 5 t-element is a tRNA like structure that has been previously detected directly upstream of the 5S rRNA in H. volcanii, it is processed by tRNase Z to generate the 5S rRNA 5 ́ end (36,37). Generally t-elements are substrates for both tRNA processing enzymes, the 5 ́ processing enzyme RNase P and the 3 ́ processing enzyme tRNase Z (36,38). Processing of the t-elements upand downstream of the crRNA should yield the mature icrRNA. We cloned the crRNA/telement insert into the Haloferax vector pTA409 (26), yielding pTA409-telecrRNA. A Haloferax strain that has the CRISPR locus C deleted (strain ∆C) was generated to get a strain without the endogenous spacer1 from locus C (Experimental Procedures). This strain was transformed with plasmid pTA409-telecrRNA yielding ∆C x pTA409-telecrRNA. Northern blot analysis showed that an icrRNA is generated with the same size as the crRNA made in the wild type strain (that generates the crRNA from the CRISPR locus C) (Figure 1C). Thus the icrRNA is efficiently generated from the plasmid. In addition some shorter RNAs are visible, these shorter crRNAs have also been reported earlier in wild type cells (22). Since the amount of icrRNA was rather low compared to the endogenous crRNA we cloned the crRNA/t-element insert into a Haloferax vector with a higher copy number: pTA232 (27), yielding pTA232-telecrRNA. Northern analysis showed that a Haloferax ∆C strain transformed with pTA232-telecrRNA generated indeed higher amounts of icrRNA (Figure 1C). To confirm that processing of the icrRNA yielded exactly the same 5 ́ and 3 ́ ends as in the "natural" crRNA production we isolated the two RNA fractions that contained the long crRNA of about 65 nucleotides (RNA fraction of 60-75 nucleotides length isolated) and the shorter crRNA of about 51 nucleotides (RNA fraction of 45-55 nucleotides length isolated) from wild type Haloferax cells and ∆C x pTA232-telecrRNA cells and analysed them with RNAseq. The icrRNAs from the 60-75 nucleotides fraction (isolated from ∆C x pTA232-telecrRNA strain) have exactly the same 5 ́ and 3 ́ ends as the wild type crRNA (Figure 2A). Thus we could show that we can generate a mature icrRNA identical to the "natural" crRNA in Haloferax cells. In addition we could show that a slightly shorter icrRNA version with 49 nucleotides length (icrRNA) is also present (Figure 2B). This shorter icrRNA has the same 5 ́ end but a 17 nucleotides shorter 3 ́ handle than icrRNA. The only difference between the "natural" crRNA and the icrRNAs is the nature of the processing product end groups: the icrRNA contains a 5 ́-phosphate group at the crRNA 5 ́ end and a 3 ́ hydroxyl group at the crRNA 3 ́ end due to processing by RNase P and tRNase Z (23,39). This is in contrast to the observed end groups generated naturally by type I Cas6 processing: a 5 ́ hydroxyl group and 2 ́3 ́ cyclic phosphate (I-C, I-E) (13,40,41) or a non cyclic 3 ́ phosphate (I-F) (18). However we show here, that the nature of the end group is not important for the interference reaction (see below). Taken together we could successfully establish a Cas6b-independent crRNA maturation pathway. icrRNAs are active in interference To investigate whether the icrRNA is active in interference we challenged Haloferax strain ∆C expressing the icrRNA (from the high copy plasmid pTA232-telecrRNA) with an invader plasmid (21). The invader plasmid contains the protospacer sequence that matches spacer 1 of CRISPR locus C from Haloferax, thus this sequence can be detected by the icrRNA. Adjacent to the protospacer is the PAM sequence TTC that is one of the six PAMs shown to be active in Haloferax to trigger degradation (21). If this invader plasmid is recognised as an invader it is degraded by the defence system and cells cannot grow on selective medium. Transformation rates of strains transformed with the invader were reduced more than 100 fold compared to transformation with a control plasmid, showing that the invader plasmid is recognised and degraded (Table 2). The same experiment was subsequently carried out with the low copy icrRNA plasmid (pTA409-telecrRNA). Again the transformation rates were reduced in comparison to a control plasmid, showing that the lower levels of icrRNA can also trigger the interference reaction. Taken together, the icrRNAs can trigger the interference reaction and thus are fully functional crRNAs. Cas6b is not required for interference in the presence of icrRNAs In the wild type situation Cas6b is required for crRNA production and it is conceivable that it could also be required for the interference reaction, since it was shown to be part of Cascade in Haloferax (I-B system), E. coli (IE), Pseudomonas aeruginosa (I-F), and Sulfolobus solfataricus (I-A) (18,22,42-46). By the Cas6b-independent generation of icrRNAs, we separated the role of Cas6b in crRNA Prokaryotic immune defence with an icrRNA and without Cas6 6 processing from its function in the interference reaction. Using icrRNAs, we can now determine whether Cas6b is also important for the interference reaction. Thus we transformed a ∆cas6b strain with pTA232-telecrRNA and subsequently with the invader plasmid. The transformation rate of these cells was greatly reduced (by factor 0.0006) (Table 2), showing that the interference reaction works without Cas6b. In the ∆cas6b strain no internal crRNAs can be generated, thus the only crRNAs present in these cells are the icrRNAs. Subsequently Cascade complexes can only be loaded with icrRNAs. This might explain the greater reduction in the transformation rate compared to ∆C: all Cascades in ∆cas6b contain the icrRNA directed against the invader plasmid, while in ∆C only a percentage of the Cascade complexes are loaded with an icrRNA, since the crRNAs from CRISPR locus P1 and P2 are also present. Taken together the Cas6b protein is not required for the interference reaction when the icrRNA is present. Essential features of the crRNA 5 ́ handle Since the icrRNA was proven to be identical to the "naturally" expressed crRNA and to be fully active in interference, we generated different versions of the icrRNA to analyse the essential features of a crRNA for the interference reaction. To identify the important nucleotides of the 5 ́ handle we generated ten different variants and analysed them for activity in the interference reaction (Table 3). All variants were transformed into strain ∆cas6b that was subsequently challenged with the invader plasmid. First we mutated the first nucleotide of the crRNA (which is a G) to a A, U or C (variants 4-6). Mutation of the first nucleotide (position -8) results in icrRNAs that are as effective in interference as the wild type icrRNA (Table 3). This is in agreement with the in vivo situation in Haloferax, where the crRNAs are generated from three different CRISPR loci, that each have a different nucleotide at position -8 of the 5 ́ handle (Figure 1A). Second, the -1 nucleotide was mutated from C to U, G and A (variants 8-10). This nucleotide has been shown in E. coli (type I-E) to be derived from the invader (47-49). In Haloferax the nucleotide -1 is a C and thus also identical to the last nucleotide of the PAM used in this study (TTC). It has therefore the potential to base pair with the invader (Figure 3A). Mutation of this nucleotide to a U does not interfere with the defence activity. The U at this position could still base pair with the complementary PAM sequence in the invader (U-G base pair) (Figure 3B). Mutation of the -1 nucleotide to a G however abolishes the defence activity, this nucleotide could not base pair anymore with the complementary PAM sequence (GxG) (Figure 3C). Surprisingly, mutation of this nucleotide from C to an A does not interfere with the defence activity, although an A at this position is not able to base pair with the complementary PAM sequence in the invader (GxA) (Figure 3D). Since the nature of the first crRNA nucleotide is not important, we next deleted this nucleotide, generating an icrRNA that is still active in interference. Deletion of the first two nucleotides results however in an icrRNA inactive in interference. A deletion of three nucleotides in the 5 ́ handle (positions -6 to -4) (variant 7) is not tolerated. The complete removal of the 5 ́ handle (variant 3) results in a crRNA that cannot trigger the interference reaction anymore. Taken together mutations in the 5 ́ handle are tolerated at the first nucleotide (position -8) and to some extent at position -1. Only the deletion of the first nucleotide of the 5 ́ handle is tolerated, all other deletions result in inactive icrRNAs. Essential features of the crRNA 3 ́ handle The crRNA 3 ́ handle in Haloferax has the potential to form a short stem loop structure at the very 3 ́ end (Figure 1A). To determine whether parts of this stem loop are required and to define the essential features of the 3 ́ handle we constructed thirteen icrRNA variants with mutations in the 3 ́ handle and analysed their activity in interference (Table 4). We mutated a nucleotide in the loop of the potential stem loop structure (G to C or U)(variants 11-12). These variants were both still active in triggering the interference reaction. The removal of four nucleotides of the 3 ́ handle in variant 1 (positions 8-11 in the 3 ́ handle) also did not interfere with the interference reaction. Likewise did the removal of eleven nucleotides in variant 2 (positions 1-11) not reduce the interference. The nature of the 3 ́ handle differs from CRISPR system to CRISPR system. In Haloferax wild type cells two types of crRNAs are observed having a 3 ́ handle of approx. 22 nucleotides and of approx. 5 nucleotides (22). A similar observation was made with the icrRNA, since a long and a short icrRNA can be detected (Figure 1C and Figure 2B), that contain a 22 nucleotide and a five nucleotide 3 ́ handle Prokaryotic immune defence with an icrRNA and without Cas6 7 (Figure 2B). To investigate how many nucleotides can be removed from the 3 ́ handle we designed several 3 ́ handle deletion variants. The five terminal nucleotides were deleted in variant 13, ten terminal nucleotides were removed in variant 14, the last 15 and 20 nucleotides were deleted in variant 15 and 16, respectively. The interference tests clearly show that all four deletions in the 3 ́ handle had no effect on the interference activity (Table 4). In variant 20 only one nucleotide of the 3 ́ handle remained, but still this crRNA was effective in triggering the interference reaction. This last nucleotide was mutated in variants 21-23 from a G to a C, A or U. Again all variants were still active. Even a complete removal of all 22 nucleotides (variant 19) did not interfere with the interference reaction. These results also suggest that the exact length of the complete crRNA is not important, since different lengths at the 3 ́ handle are tolerated. DISCUSSION We could successfully establish a Cas6bindependent crRNA maturation pathway in Haloferax cells. In this pathway icrRNAs are excised from a precursor with the help of tRNA processing enzymes, resulting in small RNAs active in the interference reaction. The icrRNAs are identical to the "natural" crRNAs except for the nature of the end groups. Cas6b is only required for crRNA maturation in type I-B Using the independently generated crRNA we could show that Cas6b is not required for any other reactions besides crRNA processing in the prokaryotic immune system I-B. As soon as the crRNA is generated without Cas6b this protein is dispensable, since it is not required for the interference reaction. We previously showed that Cas6b copurifies with Cascade in Haloferax (22), this observation might be due to the fact, that the crRNA is incorporated into Cascade and that Cas6b is still bound to the crRNA thereby co-purifying with the FLAG tagged Cas7. But although it copurifies with Cas5 and Cas7 it is not required to be part of the I-B Cascade for activity. Thus the core part of the I-B Cascade seems to consist of Cas5, Cas7 and the crRNA. These results are confirmed by the observation that the 3 ́ handle can be completely removed. Thus if the Cas6b protein is attached to Cascade via binding to the crRNA 3 ́ handle, this interaction is not essential. Essential parts of the 5 ́ handle Recent reports on the structure of the E. coli Cascade complex revealed, that the first seven nucleotides of the crRNA 5 ́ handle form a hook that interacts with the Cas5, Cas7 and Cse1 proteins (the homologous protein in Haloferax would be the Cas8b protein) (42-44). Our data clearly show that in the Haloferax I-B system the 5 ́ handle is also an important part of the crRNA. Only the first nucleotide of the 5 ́ handle can be mutated and deleted without loss of activity. This is in agreement with the in vivo situation where 3 different 5 ́ handles are generated (Figure 1A.). In the structural analyses reported for the I-E Cascade complex, the first nucleotide of the 5 ́ handle interacts with Cas5 and Cas7 (42-44). In the Haloferax system this interaction does not seem to be crucial for the activity. However, all other deletions in the 5 ́ handle abolished interference activity: deletions of the first two nucleotides, of three internal nucleotides and of all 5 ́ handle nucleotides yield a non functional crRNA, confirming the importance of the 5 ́ handle. Interaction of the crRNA 5 ́ handle with the protospacer adjacent motif The nature of the last nucleotide of the 5 ́ handle (position -1) seems to be important: mutation of this nucleotide from C to G results in loss of activity, only nucleotides C, A and U are tolerated at this position. In E. coli it has been shown that the -1 crRNA nucleotide is identical to the last PAM nucleotide and is derived from the invader (47-51), thus the crRNA could base pair with the invader at this position (Figure 3). It is not known whether the crRNA 5 ́ handle nucleotide (position -1) stems from the invader in the Haloferax I-B system. But the -1 crRNA nucleotide and invader complementary PAM nucleotide (in PAMs TTC and CAC, two of the six PAMs recognised by Haloferax) have also the potential to base pair. This base pair might be important for recognising the correct target DNA sequence. The observation that the -1 nucleotide mutant C to U works but C to G does not work would confirm this hypothesis. However, the result that the C to A mutation is still active in interference does not fit. In addition, the complementary nucleotide of the other four PAMs recognised in the Haloferax system (TAT, TAA, TAG, ACT) cannot base pair with the crRNA. In the I-E and I-F E.coli system it has been shown that the interaction between the -1 crRNA nucleotide and the last complementary PAM nucleotide is not essential Prokaryotic immune defence with an icrRNA and without Cas6 8 for invader recognition (52,53). The recent structural data for the I-E Cascade complex confirm this earlier observation showing that in this system the -1 nucleotide of the crRNA is displaced by the Cas5 protein preventing interaction with the invader PAM sequence. The same displacement of the -1 nucleotide might happen in the Haloferax I-B Cascade. And the loss of activity of the C -> G mutant could be explained by failure of the G to interact properly with the Cas5 protein. In the I-E system Cse1 (the homologous protein in Haloferax is Cas8b) interacts with the PAM sequence and target recognition occurs via identification of the PAM sequence by the Cse1 protein (18,53-55). The same might be true for the Haloferax I-B system but the Cas8b protein should be able to identify six different PAMs: TTC, CAC, TAT, TAA, TAG and ACT. Taken together our results suggest that a G at position -1 cannot interact properly with the Cas5 protein and that the Haloferax Cas8b would have to recognise all six different PAMs. Essential parts of the 3 ́ handle Mutational analysis of the icrRNA showed that the 3 ́ handle of the crRNA is completely dispensable. The shortest icrRNA found in vivo by RNAseq contained a five nucleotide long 3 ́ handle. According to the data reported here, this shorter crRNA version with only 49 nucleotides should also be active, since even an icrRNA with no 3 ́ handle is still active. Previous isolation of crRNAs from the Haloferax Cascade like complex showed that the long and the short crRNA versions co-purify (22). It would be interesting to analyse whether only the short form is the active form and whether the long form has to be activated by 3 ́ processing to yield the short functional form. Currently it is not known which enzyme(s) are catalysing this further trimming of the crRNA 3 ́ end. As soon as this enzyme is identified we could generate a strain that has the gene for this enzyme deleted and analyse whether the icrRNA with a long unprocessed 3 ́ handle is active. A shortening of the crRNA 3 ́ handle has also been reported for the type I-B system of Methanococcus maripaludis and Clostridium thermocellum (56). Thus it seems that in contrast to the I-A, I-E and I-F systems crRNAs of the I-B system are subjected to an additional 3 ́ trimming, as reported for the crRNAs in type III systems (14,15). The nature of the crRNA end group is not important The pre-icrRNA is generated by the tRNA processing enzymes to exactly the same product as the pre-crRNA generation by Cas6b. The only difference between the natural crRNA and the icrRNA is the nature of the 5 ́ and 3 ́ end groups. However in the experiments reported here, the nature of the end groups did not have any effect: neither on the shortening of the icrRNA to icrRNA nor on the interference reaction. Taken together the nature of the end groups seems not important for the interference reaction. The minimal type I-B crRNA Previously published data concerning the requirements for the spacer-protospacer interactions in the Haloferax I-B system showed that a 34 nucleotide long spacerprotospacer interaction between crRNA and invader was sufficient (16). According to these published data and the results reported here the minimal crRNA for the Haloferax type I-B system contains a seven nucleotide long 5 ́ handle, a 34 nucleotide long spacer and no 3 ́ handle (Figure 4). Altogether this crRNA would be 41 nucleotides long. Acknowledgments We are grateful to Charles J. Daniels for providing us with the synthetic promoter and terminator sequences for Haloferax ahead of publication. We thank Elli Bruckbauer for her expert technical assistance and Jutta Brendel for her kind help. We are grateful to all members of the FOR Unit 1680 for helpful discussions. Prokaryotic immune defence with an icrRNA and without Cas6
منابع مشابه
An Active Immune Defense with a Minimal CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) RNA and without the Cas6 Protein*
The prokaryotic immune system CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR-associated) is a defense system that protects prokaryotes against foreign DNA. The short CRISPR RNAs (crRNAs) are central components of this immune system. In CRISPR-Cas systems type I and III, crRNAs are generated by the endonuclease Cas6. We developed a Cas6b-independent crRNA maturation...
متن کاملRequirements for a successful defence reaction by the CRISPR-Cas subtype I-B system.
Uptake of foreign mobile genetic elements is often detrimental and can result in cell death. For protection against invasion, prokaryotes have developed several defence mechanisms, which take effect at all stages of infection; an example is the recently discovered CRISPR (clustered regularly interspaced short palindromic repeats)-Cas (CRISPR-associated) immune system. This defence system direct...
متن کاملStructure of a dimeric crenarchaeal Cas6 enzyme with an atypical active site for CRISPR RNA processing
The competition between viruses and hosts is played out in all branches of life. Many prokaryotes have an adaptive immune system termed 'CRISPR' (clustered regularly interspaced short palindromic repeats) which is based on the capture of short pieces of viral DNA. The captured DNA is integrated into the genomic DNA of the organism flanked by direct repeats, transcribed and processed to generate...
متن کاملCRISPR interference: a structural perspective
CRISPR (cluster of regularly interspaced palindromic repeats) is a prokaryotic adaptive defence system, providing immunity against mobile genetic elements such as viruses. Genomically encoded crRNA (CRISPR RNA) is used by Cas (CRISPR-associated) proteins to target and subsequently degrade nucleic acids of invading entities in a sequence-dependent manner. The process is known as 'interference'. ...
متن کاملThe RNA- and DNA-targeting CRISPR-Cas immune systems of Pyrococcus furiosus.
Using the hyperthermophile Pyrococcus furiosus, we have delineated several key steps in CRISPR (clustered regularly interspaced short palindromic repeats)-Cas (CRISPR-associated) invader defence pathways. P. furiosus has seven transcriptionally active CRISPR loci that together encode a total of 200 crRNAs (CRISPR RNAs). The 27 Cas proteins in this organism represent three distinct pathways and ...
متن کاملCas6 specificity and CRISPR RNA loading in a complex CRISPR-Cas system
CRISPR-Cas is an adaptive prokaryotic immune system, providing protection against viruses and other mobile genetic elements. In type I and type III CRISPR-Cas systems, CRISPR RNA (crRNA) is generated by cleavage of a primary transcript by the Cas6 endonuclease and loaded into multisubunit surveillance/effector complexes, allowing homology-directed detection and cleavage of invading elements. Hi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014